Goto

Collaborating Authors

 Tottori Prefecture


A Survey of Explainable Reinforcement Learning: Targets, Methods and Needs

Saulières, Léo

arXiv.org Artificial Intelligence

The success of recent Artificial Intelligence (AI) models has been accompanied by the opacity of their internal mechanisms, due notably to the use of deep neural networks. In order to understand these internal mechanisms and explain the output of these AI models, a set of methods have been proposed, grouped under the domain of eXplainable AI (XAI). This paper focuses on a sub-domain of XAI, called eXplainable Reinforcement Learning (XRL), which aims to explain the actions of an agent that has learned by reinforcement learning. We propose an intuitive taxonomy based on two questions "What" and "How". The first question focuses on the target that the method explains, while the second relates to the way the explanation is provided. We use this taxonomy to provide a state-of-the-art review of over 250 papers. In addition, we present a set of domains close to XRL, which we believe should get attention from the community. Finally, we identify some needs for the field of XRL.


Inter-Passage Verification for Multi-evidence Multi-answer QA

Chen, Bingsen, Wang, Shengjie, Ye, Xi, Zhao, Chen

arXiv.org Artificial Intelligence

Multi-answer question answering (QA), where questions can have many valid answers, presents a significant challenge for existing retrieval-augmented generation-based QA systems, as these systems struggle to retrieve and then synthesize a large number of evidence passages. To tackle these challenges, we propose a new multi-answer QA framework -- Retrieval-augmented Independent Reading with Inter-passage Verification (RI$^2$VER). Our framework retrieves a large set of passages and processes each passage individually to generate an initial high-recall but noisy answer set. Then we propose a new inter-passage verification pipeline that validates every candidate answer through (1) Verification Question Generation, (2) Gathering Additional Evidence, and (3) Verification with inter-passage synthesis. Evaluations on the QAMPARI and RoMQA datasets demonstrate that our framework significantly outperforms existing baselines across various model sizes, achieving an average F1 score improvement of 11.17%. Further analysis validates that our inter-passage verification pipeline enables our framework to be particularly beneficial for questions requiring multi-evidence synthesis.


Zero-Shot Keyphrase Generation: Investigating Specialized Instructions and Multi-Sample Aggregation on Large Language Models

Mohan, Jayanth, Chowdhury, Jishnu Ray, Malik, Tomas, Caragea, Cornelia

arXiv.org Artificial Intelligence

Keyphrases are the essential topical phrases that summarize a document. Keyphrase generation is a long-standing NLP task for automatically generating keyphrases for a given document. While the task has been comprehensively explored in the past via various models, only a few works perform some preliminary analysis of Large Language Models (LLMs) for the task. Given the impact of LLMs in the field of NLP, it is important to conduct a more thorough examination of their potential for keyphrase generation. In this paper, we attempt to meet this demand with our research agenda. Specifically, we focus on the zero-shot capabilities of open-source instruction-tuned LLMs (Phi-3, Llama-3) and the closed-source GPT-4o for this task. We systematically investigate the effect of providing task-relevant specialized instructions in the prompt. Moreover, we design task-specific counterparts to self-consistency-style strategies for LLMs and show significant benefits from our proposals over the baselines.


Biohybrid Microrobots Based on Jellyfish Stinging Capsules and Janus Particles for In Vitro Deep-Tissue Drug Penetration

Park, Sinwook, Barak, Noga, Lotan, Tamar, Yossifon, Gilad

arXiv.org Artificial Intelligence

Microrobots engineered from self-propelling active particles, extend the reach of robotic operations to submillimeter dimensions and are becoming increasingly relevant for various tasks, such as manipulation of micro/nanoscale cargo, particularly targeted drug delivery. However, achieving deep-tissue penetration and drug delivery remain a challenge. This work developed a novel biohybrid microrobot consisting of jellyfish stinging capsules, which act as natural nanoinjectors for efficient penetration and delivery, assembled onto an active Janus particle (JP). While microrobot transport and navigation was externally controlled by magnetic field-induced rolling, capsule loading onto the JP surface was controlled by electric field. Following precise navigation of the biohybrid microrobots to the vicinity of target tissues, the capsules were activated by a specific enzyme introduced to the solution, which then triggered tubule ejection and release of the preloaded molecules. Use of such microrobots for penetration of and delivery of the preloaded drug/toxin to targeted cancer spheroids and live Caenorhabditis elegans was demonstrated in-vitro. The findings offer insights for future development of bio-inspired microrobots capable of deep penetration and drug delivery. Future directions may involve encapsulation of various drugs within different capsule types for enhanced versatility. This study may also inspire in-vivo applications involving deep tissue drug delivery.


Speechworthy Instruction-tuned Language Models

Cho, Hyundong, Jedema, Nicolaas, Ribeiro, Leonardo F. R., Sharma, Karishma, Szekely, Pedro, Moschitti, Alessandro, Janssen, Ruben, May, Jonathan

arXiv.org Artificial Intelligence

Current instruction-tuned language models are exclusively trained with textual preference data and thus are often not aligned with the unique requirements of other modalities, such as speech. To better align language models with the speech domain, we explore (i) prompting strategies grounded in radio-industry best practices and (ii) preference learning using a novel speech-based preference data of 20K samples, generated with a wide spectrum of prompts that induce varying dimensions of speech-suitability and labeled by annotators who listen to response pairs. Both human and automatic evaluation show that both prompting and preference learning increase the speech-suitability of popular instruction-tuned LLMs. Interestingly, we find that prompting and preference learning can be additive; combining them achieves the best win rates in head-to-head comparison, resulting in responses that are preferred or tied to the base model in 76.2% of comparisons on average. Lastly, we share lexical, syntactical, and qualitative analyses to showcase how each method contributes to improving the speech-suitability of generated responses.


Deciphering interventional dynamical causality from non-intervention systems

Shi, Jifan, Li, Yang, Zhao, Juan, Leng, Siyang, Aihara, Kazuyuki, Chen, Luonan, Lin, Wei

arXiv.org Machine Learning

Detecting and quantifying causality is a focal topic in the fields of science, engineering, and interdisciplinary studies. However, causal studies on non-intervention systems attract much attention but remain extremely challenging. To address this challenge, we propose a framework named Interventional Dynamical Causality (IntDC) for such non-intervention systems, along with its computational criterion, Interventional Embedding Entropy (IEE), to quantify causality. The IEE criterion theoretically and numerically enables the deciphering of IntDC solely from observational (non-interventional) time-series data, without requiring any knowledge of dynamical models or real interventions in the considered system. Demonstrations of performance showed the accuracy and robustness of IEE on benchmark simulated systems as well as real-world systems, including the neural connectomes of C. elegans, COVID-19 transmission networks in Japan, and regulatory networks surrounding key circadian genes.


Principal Component Analysis as a Sanity Check for Bayesian Phylolinguistic Reconstruction

Murawaki, Yugo

arXiv.org Artificial Intelligence

Bayesian approaches to reconstructing the evolutionary history of languages rely on the tree model, which assumes that these languages descended from a common ancestor and underwent modifications over time. However, this assumption can be violated to different extents due to contact and other factors. Understanding the degree to which this assumption is violated is crucial for validating the accuracy of phylolinguistic inference. In this paper, we propose a simple sanity check: projecting a reconstructed tree onto a space generated by principal component analysis. By using both synthetic and real data, we demonstrate that our method effectively visualizes anomalies, particularly in the form of jogging.


Shape-programmable Adaptive Multi-material Microrobots for Biomedical Applications

Tan, Liyuan, Yang, Yang, Fang, Li, Cappelleri, David J.

arXiv.org Artificial Intelligence

Abstract: Flagellated microorganisms can swim at low Reynolds numbers and adapt to changes in their environment. Specifically, the flagella can switch their shapes or modes through gene expression. In the past decade, efforts have been made to fabricate and investigate rigid types of microrobots without any adaptation to the environments. More recently, obtaining adaptive microrobots mimicking real microorganisms is getting more attention. However, even though some adaptive microrobots achieved by hydrogels have emerged, the swimming behaviors of the microrobots before and after the environment-induced deformations are not predicted in a systematic standardized way. In this work, experiments, finite element analysis, and dynamic modeling are presented together to realize a complete understanding of these adaptive microrobots. The above three parts are cross-verified proving the success of using such methods, facilitating the bio-applications with shape-programmable and even swimming performance-programmable microrobots. Moreover, an application of targeted object delivery using the proposed microrobot has been successfully demonstrated. Finally, cytotoxicity tests are performed to prove the potential for using the proposed microrobot for biomedical applications. One-Sentence Summary: A systematic approach to design shape-programable, dual-function, and adaptive microrobots for biomedical applications. Main Text: INTRODUCTION Microorganisms are capable of swimming with flagella to provide motility (1-3). These microorganisms can adapt their flagella into different shapes or modes by altering gene expression to accommodate environmental changes or for other proposes like nutrition, hosting, and invasion (4). For example, the flagella of a spermatozoon of Echinus esculentus will result in a transition from a planar to a helical shape when the viscosity is increased and back to a quasi-planar shape when it is further increased (5). Moreover, recent investigations show that the flagella can deform to wrap around the cell body to escape from traps or to enhance the efficiency of environmental exploration (6, 7). Inspired by these natural living beings, many microrobots have been fabricated to swim in this microscale world. The two strategies most adopted to achieve motility are the helical structures mimicking the flagella of bacterial E. coli and the flexible body replicating the motion of a spermatozoa (8). In the last decade, various helical-type microrobots are realized with fixed shapes, i.e., the structure will not change once it is fabricated (9-11).


The Anatomy Spread of Online Opinion Polarization: The Pivotal Role of Super-Spreaders in Social Networks

Kawahata, Yasuko

arXiv.org Artificial Intelligence

The study investigates the role of 'superspreaders' in shaping opinions within networks, distinguishing three types: A, B, and C. Type A has a significant influence in shaping opinions, Type B acts as a counterbalance to A, and Type C functions like media, providing an objective viewpoint and potentially regulating A and B's influence. The research uses a confidence coefficient and z-score to survey superspreaders' behaviors, with a focus on the conditions affecting group dynamics and opinion formation, including environmental factors and forgetfulness over time. The findings offer insights for improving online communication security and understanding social influence.


Perspective in Opinion Dynamics on Complex Convex Domains of Time Networks for Addiction, Forgetting

Kawahata, Yasuko

arXiv.org Artificial Intelligence

This paper revises previous work and introduces changes in spatio-temporal scales. The paper presents a model that includes layers A and B with varying degrees of forgetting and dependence over time. We also model changes in dependence and forgetting in layers A, A', B, and B' under certain conditions. In addition, to discuss the formation of opinion clusters that have reinforcing or obstructive behaviors of forgetting and dependence and are conservative or brainwashing or detoxifying and less prone to filter bubbling, new clusters C and D that recommend, obstruct, block, or incite forgetting and dependence over time are Introduction. This introduction allows us to test hypotheses regarding the expansion of opinions in two dimensions over time and space, the state of development of opinion space, and the expansion of public opinion. Challenges in consensus building will be highlighted, emphasizing the dynamic nature of opinions and the need to consider factors such as dissent, distrust, and media influence. The paper proposes an extended framework that incorporates trust, distrust, and media influence into the consensus building model. We introduce network analysis using dimerizing as a method to gain deeper insights. In this context, we discuss network clustering, media influence, and consensus building. The location and distribution of dimers will be analyzed to gain insight into the structure and dynamics of the network. Dimertiling has been applied in various fields other than network analysis, such as physics and sociology. The paper concludes by emphasizing the importance of diverse perspectives, network analysis, and influential entities in consensus building. It also introduces torus-based visualizations that aid in understanding complex network structures.